Optimizing Feature Representation for Automated Systematic Review Work Prioritization

نویسنده

  • Aaron M. Cohen
چکیده

Automated document classification can be a valuable tool for enhancing the efficiency of creating and updating systematic reviews (SRs) for evidence-based medicine. One way document classification can help is in performing work prioritization: given a set of documents, order them such that the most likely useful documents appear first. We evaluated several alternate classification feature systems including unigram, n-gram, MeSH, and natural language processing (NLP) feature sets for their usefulness on 15 SR tasks, using the area under the receiver operating curve as a measure of goodness. We also examined the impact of topic-specific training data compared to general SR inclusion data. The best feature set used a combination of n-gram and MeSH features. NLP-based features were not found to improve performance. Furthermore, topic-specific training data usually provides a significant performance gain over more general SR training.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Research Paper: Cross-Topic Learning for Work Prioritization in Systematic Review Creation and Update

OBJECTIVE Machine learning systems can be an aid to experts performing systematic reviews (SRs) by automatically ranking journal articles for work-prioritization. This work investigates whether a topic-specific automated document ranking system for SRs can be improved using a hybrid approach, combining topic-specific training data with data from other SR topics. DESIGN A test collection was b...

متن کامل

Research prioritization of men’s health and urologic diseases

OBJECTIVES We sought to determine whether disease representation in the Cochrane Database of Systematic Reviews (CDSR) reflects disease burden, measured by the Global Burden of Disease (GBD) Study as disability-adjusted life-years (DALYs). MATERIALS AND METHODS Two investigators performed independent assessment of ten men's health and urologic diseases (MHUDs) in CDSR for systematic review an...

متن کامل

Ethical Patient Prioritization in Disaster Triage: A Protocol for a Systematic Review

Background: Disasters are medically defined as events in which the demands for patients’ care far exceed the available resources. In such situations, triage and rationing of limited resources are inevitable. A decision regarding triage needs not only scientific guidelines but also an ethical framework and supporting policies. This study aims to provide a comprehensive review of the criteria for...

متن کامل

Identifying gaps in research prioritization: The global burden of neglected tropical diseases as reflected in the Cochrane database of systematic reviews

BACKGROUND Neglected tropical diseases (NTDs) impact disadvantaged populations in resource-scarce settings. Availability of synthesized evidence is paramount to end this disparity. The aim of the study was to determine whether NTD systematic reviews or protocols in the Cochrane Database of Systematic Reviews (CDSR) reflect disease burden. METHODS Two authors independently searched the CDSR fo...

متن کامل

Anomaly Detection Using SVM as Classifier and Decision Tree for Optimizing Feature Vectors

Abstract- With the advancement and development of computer network technologies, the way for intruders has become smoother; therefore, to detect threats and attacks, the importance of intrusion detection systems (IDS) as one of the key elements of security is increasing. One of the challenges of intrusion detection systems is managing of the large amount of network traffic features. Removing un...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • AMIA ... Annual Symposium proceedings. AMIA Symposium

دوره   شماره 

صفحات  -

تاریخ انتشار 2008